# Lightweight text generation

ERNIE 4.5 0.3B PT GGUF
Apache-2.0
This model is a GGUF format conversion version of Baidu's ERNIE-4.5-0.3B-PT, supporting Chinese and English text generation tasks.
Large Language Model Supports Multiple Languages
E
wqerrewetw
173
1
Jan Nano 8bit
Apache-2.0
Jan-nano-8bit is an 8-bit quantized version converted from the Menlo/Jan-nano model, optimized for the MLX framework and suitable for text generation tasks.
Large Language Model
J
mlx-community
188
1
Huihui Ai.magistral Small 2506 Abliterated GGUF
The Huihui AI Quantized Model is a quantized version of Magistral-Small-2506-abliterated, dedicated to making knowledge accessible to everyone.
Large Language Model
H
DevQuasar
423
1
Sentientagi.dobby Mini Unhinged Plus Llama 3.1 8B GGUF
This project provides a quantized version of Dobby-Mini-Unhinged-Plus-Llama-3.1-8B, aiming to make knowledge accessible to everyone.
Large Language Model
S
DevQuasar
181
1
Dleemiller.penny 1.7B GGUF
Penny - 1.7B is a quantized version of a large language model dedicated to making knowledge accessible to everyone.
Large Language Model
D
DevQuasar
113
1
Dmindai.dmind 1 Mini GGUF
DMind-1-mini is a lightweight text generation model suitable for various natural language processing tasks.
Text Generation
D
DevQuasar
213
1
Mlabonne Qwen3 0.6B Abliterated GGUF
This is a quantized version based on the Qwen3-0.6B-abliterated model, using llama.cpp for quantization, suitable for text generation tasks.
Large Language Model
M
bartowski
1,455
2
Qwen Qwen3 0.6B GGUF
Apache-2.0
This repository contains GGUF format model files for Qwen/Qwen3-0.6B, quantized by TensorBlock's machines and compatible with llama.cpp.
Large Language Model
Q
tensorblock
905
3
Qwen3 0.6B GGUF
GGUF quantized version of Qwen3-0.6B, suitable for text generation tasks.
Large Language Model
Q
MaziyarPanahi
233.95k
2
Qwen2 96M
Apache-2.0
Qwen2-96M is a miniature language model based on the Qwen2 architecture, containing 96 million parameters and supporting a context length of 8192 tokens, suitable for English text generation tasks.
Large Language Model English
Q
Felladrin
76
2
Tesslate Tessa T1 3B GGUF
Apache-2.0
Tessa-T1-3B is a 3B-parameter large language model based on the Qwen2 architecture, offering multiple quantization versions to accommodate different hardware requirements.
Large Language Model English
T
bartowski
697
6
Llama 3.1 8B RainbowLight EtherealMix GGUF
This is a quantized version in GGUF format based on the Llama-3.1-8B-RainbowLight-EtherealMix model, which facilitates the development of applications related to text generation.
Large Language Model
L
MaziyarPanahi
101
1
Qwen2.5 1.5B Instruct GGUF
The GGUF format file of the Qwen2.5-1.5B-Instruct model, suitable for text generation tasks.
Large Language Model
Q
MaziyarPanahi
183.11k
6
Gemma 2 2b It Abliterated GGUF
Gemma-2-2b-it-abliterated is a 2.2B parameter language model based on the Google Gemma architecture, optimized through quantization for text generation tasks.
Large Language Model English
G
bartowski
10.55k
60
Gemma 2 2b It
Gemma is a lightweight open model series launched by Google, built on the technology used to create Gemini models, suitable for various text generation tasks.
Large Language Model Transformers
G
google
342.64k
1,064
Gemma 2 27b
Gemma is a lightweight open-source large language model launched by Google, built with the same technology as Gemini, suitable for text generation tasks.
Large Language Model Transformers
G
google
11.89k
207
Phi 3 Mini 4k Instruct Bnb 4bit
Other
The 4-bit quantization version of Phi-3-mini-4k-instruct, quantized using the bitsandbytes tool, is designed specifically for fine-tuning.
Large Language Model Transformers
P
leliuga
1,541
4
Qwen1.5 Moe Tiny Random
This is a small randomly initialized model based on the Qwen1.5-MoE architecture, using float16 precision, suitable for text generation tasks.
Large Language Model Transformers
Q
yujiepan
30
1
Phi 2 Super GGUF
MIT
phi-2-super-GGUF is the GGUF quantized version of the abacaj/phi-2-super model, suitable for local execution and text generation tasks.
Large Language Model Transformers
P
MaziyarPanahi
158
5
Minueza 32M Base
Apache-2.0
Minueza-32M-Base is a base model with 32 million parameters, fully trained on extensive English text corpora, suitable for text generation tasks.
Large Language Model Transformers English
M
Felladrin
68
18
Gemma 2b
Gemma is a lightweight open-source large language model series launched by Google, built on the technology used to create Gemini models, offering a base version with 2 billion parameters.
Large Language Model
G
google
402.85k
994
Phi2 Chinese 0.2B
Apache-2.0
A 200-million-parameter Chinese causal language model based on the Phi2 architecture, supporting text generation tasks
Large Language Model Transformers Supports Multiple Languages
P
charent
65
30
Tinyllama V0 GGUF
MIT
TinyLLama-v0 is a lightweight language model provided in GGUF format, suitable for text generation tasks.
Large Language Model English
T
aladar
72
2
Puma 3B
Apache-2.0
Puma-3B is a text generation model fine-tuned based on OpenLLaMA 3B V2. It is trained on the ShareGPT Hyperfiltered dataset and is suitable for various text generation tasks.
Large Language Model Transformers English
P
acrastt
427
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase